Clustering algorithm for data analysis of the Fiber detectors of the HypHI project

نویسندگان

  • O. Borodina
  • V. Bozkurt
  • E. Kim
  • F. Maas
  • S. Minami
  • D. Nakajima
  • T. R. Saito
  • W. Trautmann
چکیده

The HypHI project aims to study hypernuclei by means of collisions of stable heavy ion and RI beams on stable target materials. As the first step (Phase 0), the feasibility of hypernuclear spectroscopy with heavy ion beams was investigated with a Li beam at 2 A GeV impinged on a C target by identifying ΛH, 4 ΛH and 5 ΛHe hypernuclei [1]. The Phase 0 experiment was performed in August and October 2009. In addition a new experiment has been performed in March 2010, in which a Ne beam was impinged on a C target at an energy of 2 A GeV. A dedicated algorithm for hit clustering has been considered to replace the current algorithm which is not flexible enough to handle the experimental behavior of the fiber detectors. The new algorithm should be able to handle channel gaps within a hit cluster which corresponds to the cross talk behavior of the PMT of the fiber detectors. As well outliners, corresponding to noisy channels, should not affect the clustering procedure and could be detected for possible rejection. The algorithm should also provide good clustering whatever is the detector occupancy, which is the fraction of fired channels over the total number of channels of considered fiber layer. The most important feature of the algorithm is that it should not require the number of clusters as an input since event by event the hit pattern on detectors are different, so that prior assumption is impossible. The proposed algorithm is based on Hierarchical clustering algorithm which consists in agglomerate fired channels using the distance matrix as clustering criteria. The definition of the distance gives different behavior on the clustering procedure. The complete-link method is used, in which the distance corresponds to the maximum euclidian distance between pairwise channels, producing compact clusters [2]. The use of a goodness criterion allows to assess the quality of each cluster and of the full hit partition. By using this kind of criteria, produced hit clusters can be merged in order to improve the global goodness of the clustering context. The number of clusters is then obtained when the criterion is optimal. The algorithm uses the silhouette factor, s(i), proposed in [3]. For each channel i:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem

Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...

متن کامل

An Improved SSPCO Optimization Algorithm for Solve of the Clustering Problem

Swarm Intelligence (SI) is an innovative artificial intelligence technique for solving complex optimization problems. Data clustering is the process of grouping data into a number of clusters. The goal of data clustering is to make the data in the same cluster share a high degree of similarity while being very dissimilar to data from other clusters. Clustering algorithms have been applied to a ...

متن کامل

A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data

The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...

متن کامل

Persistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm

Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...

متن کامل

A Clustering Approach by SSPCO Optimization Algorithm Based on Chaotic Initial Population

Assigning a set of objects to groups such that objects in one group or cluster are more similar to each other than the other clusters’ objects is the main task of clustering analysis. SSPCO optimization algorithm is anew optimization algorithm that is inspired by the behavior of a type of bird called see-see partridge. One of the things that smart algorithms are applied to solve is the problem ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011